geospatial dataset
TorchGeo: Deep Learning With Geospatial Data
Stewart, Adam J., Robinson, Caleb, Corley, Isaac A., Ortiz, Anthony, Ferres, Juan M. Lavista, Banerjee, Arindam
Remotely sensed geospatial data are critical for applications including precision agriculture, urban planning, disaster monitoring and response, and climate change research, among others. Deep learning methods are particularly promising for modeling many remote sensing tasks given the success of deep neural networks in similar computer vision tasks and the sheer volume of remotely sensed imagery available. However, the variance in data collection methods and handling of geospatial metadata make the application of deep learning methodology to remotely sensed data nontrivial. For example, satellite imagery often includes additional spectral bands beyond red, green, and blue and must be joined to other geospatial data sources that can have differing coordinate systems, bounds, and resolutions. To help realize the potential of deep learning for remote sensing applications, we introduce TorchGeo, a Python library for integrating geospatial data into the PyTorch deep learning ecosystem. TorchGeo provides data loaders for a variety of benchmark datasets, composable datasets for generic geospatial data sources, samplers for geospatial data, and transforms that work with multispectral imagery. TorchGeo is also the first library to provide pre-trained models for multispectral satellite imagery (e.g., models that use all bands from the Sentinel-2 satellites), allowing for advances in transfer learning on downstream remote sensing tasks with limited labeled data. We use TorchGeo to create reproducible benchmark results on existing datasets and benchmark our proposed method for preprocessing geospatial imagery on the fly. TorchGeo is open source and available on GitHub: https://github.com/microsoft/torchgeo.
Geospatial Reasoning with Shapefiles for Supporting Policy Decisions
Santos, Henrique, McCusker, James P., McGuinness, Deborah L.
Policies are authoritative assets that are present in multiple domains to support decision-making. They describe what actions are allowed or recommended when domain entities and their attributes satisfy certain criteria. It is common to find policies that contain geographical rules, including distance and containment relationships among named locations. These locations' polygons can often be found encoded in geospatial datasets. We present an approach to transform data from geospatial datasets into Linked Data using the OWL, PROV-O, and GeoSPARQL standards, and to leverage this representation to support automated ontology-based policy decisions. We applied our approach to location-sensitive radio spectrum policies to identify relationships between radio transmitters coordinates and policy-regulated regions in Census.gov datasets. Using a policy evaluation pipeline that mixes OWL reasoning and GeoSPARQL, our approach implements the relevant geospatial relationships, according to a set of requirements elicited by radio spectrum domain experts.
Machine Learning on Geospatial Datasets for Segmentation, Prediction and Modeling
The geospatial world is full of such datasets where its hard to know exactly how the input variables to your model will effect the outcomes. There exists a growing ecosystem of libraries and frameworks like Tensor Flow and Scikit-Learn that allow for sophisticated machine learning to take place but very few are easily interoperable with geospatial frameworks like PostgreSQL.. In this talk I will discuss ongoing work at CartoDB to integrate machine learning as a key analysis tool for geospatial data. Focusing on our work using random forests, neural networks and Markov chains I will talk about how these methods need to be adapted to work with geospatial data, how we can use the PL/Python extension in PostgreSQL to bring the power of these models to our geospatial data sets and discuss kinds of new analysis these methods open up In particular I will discuss about our work to develop segmentation models that are able to take a set of example observations and train a predictive model based underlying multivariate geospatial datasets like the census and use this model to predict new observations in regions where the original data was missing..